DM825 - Introduction to Machine Learning
Sheet 11, Spring 2013 [pdf format]

Exercise 1 – Probability theory

Prove the following rule:

p(x_i|x_−i)=

p(x₁,…,x_N)

∫	p(x₁,…,x_N) dx_i

where x_−i={x₁,…,x_N} ∖ x_i.

Exercise 2 – Naive Bayes

Consider the binary classification problem of spam email in which a binary label Y ∈ {0, 1} is to be predicted from a feature vector X = (X₁, X₂, …, X_n), where X_i=1 if the word i is present in the email and 0 otherwise. Consider a naive Bayes model, in which the components X_i are assumed mutually conditionally independent given the class label Y.

[a] Draw a directed graphical model corresponding to the naive Bayes model.
Find a mathematical expression for the posterior class probability p(Y = 1 | x), in terms of the prior class probability p(Y = 1) and the class-conditional densities p(x_i | y).
Make now explicit the hyperparameters of the Bernoulli distributions for Y and X_i. Call them, µ and θ_i, respectively. Assume a beta distribution for the prior of these hyperparameters and show how to learn the hyperparameters from a set of training data (y^j,x^→j)_j=1^m using a Bayesian approach. Compare this solution with the one developed in class via maximum likelihood.

Exercise 3 – Directed Graphical Models

Consider the graph in Figure left.

Write down the standard factorization for the given graph.
For what pairs (i, j) does the statement X_i is independent of X_j hold? (Don’t assume any conditioning in this part.)
Suppose that we condition on {X₂, X₉}, shown shaded in the graph. What is the largest set A for which the statement X₁ is conditionally independent of X_A given {X₂, X₉} holds?
What is the largest set B for which X₈ is conditionally independent of X_B given {X₂, X₉} holds?
Suppose that I wanted to draw a sample from the marginal distribution p(x₅) = Pr[X₅ = x₅]. (Don’t assume that X₂ and X₉ are observed.) Describe an efficient algorithm to do so without actually computing the marginal.

Figure 1: A directed graph.

DM825 - Introduction to Machine Learning Sheet 11, Spring 2013 [pdf format]

DM825 - Introduction to Machine Learning
Sheet 11, Spring 2013 [pdf format]